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ABSTRACT 

We present an effective implementation of analytical calculations of the Lya 
opacity distribution of the Intergalactic Medium (IGM) along multiple lines 
of sight (LOS) to distant quasars in a cosmological setting. The method as- 
sumes that the distribution of neutral hydrogen follows that of an under- 
lying dark matter density field and that the density distribution is a (lo- 
cal) lognormal distribution. It fully accounts for the expected correlations 
between LOS and the cosmic variance in the large-scale modes of the dark 
matter distribution. Strong correlations extending up to ~ 300 kpc (proper) 
and more are found at redshift z ~ 2 — * 3, in agreement with observa- 
tions. These correlations are investigated using the cross-correlation coeffi- 
cient and the cross-power spectrum of the flux distribution along different 
LOS and by identifying coincident absorption features as fitted with a Voigt 
profile fitting routine. The cross-correlation coefficient between the LOS can 
be used to constrain the shape-parameter T of the power spectrum if the 
temperature and the temperature density relation of the IGM can be de- 
termined indepedently. We also propose a new technique to recover the 3D 
linear dark matter power spectrum by integrating over ID flux cross-spectra 
which is complementary to the usual 'differentiation' of ID auto-spectra. The 
cross-power spectrum suffers much less from errors uncorrelated in differ- 
ent LOS, like those introduced by continuum fitting. Investigations of the 
flux correlations in adjacent LOS should thus allow to extend studies of the 
dark matter power spectrum with the Lya forest to significantly larger scales 
than is possible with flux auto-power spectra. 30 pairs with separation of 
1-2 arcmin should be sufficient to determine the ID cross-spectrum at scales of 
60 h" 1 Mpc with an accuracy of about 30% (corresponding to a 15% error of 
the rms density fluctuation amplitude) if the error is dominated by cosmic 
variance. 

Key words: Cosmology: theory - intergalactic medium - large-scale structure 
of universe - quasars: absorption lines 



1 INTRODUCTION 

The current understanding of QSO spectra blueward of Lya emission, the so-called Lya forest, is based on the idea 
that the Lya absorption is produced by the inhomogenous distribution of the Intergalactic Medium (IGM) along the 
line of sight (Bahcall & Salpeter 1965; Gunn & Peterson 1965). The IGM is thereby believed to be warm (~ 10 4 K) 
and photoionized. The rather high flux of the ultraviolet background radiation results in a small neutral hydrogen 
fraction with Lya optical depth of order unity which is responsible for a 'fluctuating Gunn-Peterson effect' (see e.g. 
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Rauch 1998, for a review). Such a fluctuating Gunn- Peterson effect arises naturally in standard hierarchical models 
for structure formation where the matter clusters gravitationally into filamentary and sheet-like structures. The low- 
column density (Nhi < 10 14 ' 5 cm~ 2 ) absorption lines are generated by local fluctuations in the IGM, which smoothly 
trace the mildly non-linear dark matter filaments and sheets on scales larger the Jeans scale of the photoionized IGM 
(Cen et al. 1994, Miralda-Escude et al. 1996, Zhang et al. 1998). 

This picture is supported by analytical studies based on simple models for the IGM dynamics. Various models 
of this kind have been proposed, based on either a local non-linear mapping of the linear density contrast, such as 
the lognormal model (Coles & Jones 1991), applied to the IGM dynamics (Bi, Borner & Chu 1992; Bi 1993; Bi, Ge 
& Fang 1995; Bi & Davidsen 1997, hereafter BD97), or on suitable modifications of the Zel'dovich approximation 
(Zel'dovich 1970) to account for the smoothing caused by gas pressure on the baryon Jeans scale (Reisenegger & 
Miralda-Escude 1995; Gnedin & Hui 1996; Hui, Gnedin & Zhang 1997, Matarrese & Mohayaee 2001). 

The most convincing support for this picture comes, however, from the comparison of simulated spectra produced 
from hydrodynamical numerical simulations with observed spectra (Cen et al. 1994; Zhang, Anninos & Norman 1995, 
1997; Miralda-Escude et al. 1996; Hernquist et al. 1996; Charlton et al. 1997; Theuns et al. 1998). The numerical 
simulations have been demonstrated to reproduce many observed properties of the Lya forest very well. Simple 
analytic schemes, as the one developed here, can be calibrated by the results of numerical simulations. They then 
become an important complementary tool for studying the Lya forest. They can be used to explore larger regions of 
model parameter space and can better account for the cosmic variance of large-scale modes. These are poorly probed 
by existing hydro-simulations which have to adopt relatively small computational boxes in order to resolve the Jeans 
scale of the warm photoionized IGM. 

Observationally, the unprecedented high resolution observations of the Keck HIRES spectrograph and the UV 
spectroscopic capabilities of the HST have been instrumental in shaping our current understanding of Lya forest. 
HIRES allowed to detect lines with column densities as low as Nhi ~ 10 12 cm~ 2 while HST made a detailed analysis 
of the low-redshift Lya forest at z < 1.6 possible. From the study of absorption spectra along single lines of sight 
(LOS) to distant QSOs we have, for example, gained important information on the baryon density of the Universe 
(Rauch et al. 1997) and on the temperature and equation of state of the IGM (Schaye et al. 2000). Another important 
application is the determination of shape and amplitude of the power spectrum of the spatial distribution of dark 
matter at redshift z ~ 3, from the fluctuating Lya flux, which places important constraints on the parameters of 
structure formation models (Croft et al. 1998, 1999, 2000; Nusser & Haehnelt 1999, 2000; White & Croft 2000; 
Narayanan et al. 2000). 

In this paper, we will concentrate on the information which can be extracted from the expected flux-correlations 
in adjacent LOS (see Charlton et al. 1997 for an analysis of hydro simulations). Observations of multiple systems are 
an excellent tool to probe the actual 3D distribution of matter in the Universe and to give estimates of the size of 
the absorbing structurs. Another important application of this type of study is to constrain the global geometry of 
the Universe (Hui, Stebbins & Buries (1999) and McDonald & Miralda-Escude (1999)). 

There is a number of cases in which common absorption systems in spatially separated LOS have been observed. 
These are either multiple images of gravitationally lensed quasars (Foltz et al. 1984; Smette et al. 1992, 1995; Rauch 
et al. 1999) or close quasar pairs (Bechtold et al. 1994; Dinshaw et al. 1995, 1995, 1997; Fang et al. 1996; Crotts & 
Fang 1998; D'Odorico et al. 1998; Petitjean et al. 1998; Williger et al. 2000; Liske et al. 2000). The results concerning 
the typical size of the absorbing structures are somewhat controversial. Crotts & Fang (1998) analysed a total number 
of five QSOs in close groupings: a pair and a triplet, using Keck and HST data, with different separations ranging 
from 9.5 to 177 arcsec (corresponding to a proper distance of 40-700 hT 1 kpc in an Einstein-de Sitter Universe) 
in a redshift range 0.48 < z < 2.52. For the strongest lines identified by Voigt profile fitting they found a tight 
correspondence between lines in different LOS up to a proper separation of 0.5 — 0.8 /i _1 Mpc. Their estimate of the 
size of the absorbers using a Bayesian model (Fang et al. 1996) with the assumption that the absorbers are spherical 
with uniform radius did, however, show a dependence on the separation of the QSO pair. This suggests that the 
assumption of a spherical absorber is not correct, and that the absorbers are elongated or sheet-like (Charlton et al. 
1997; see also Rauch & Haehnelt 1995, for an independent argument). A similar analysis by D'Odorico et al. (1998) 
of several QSO pairs with a median redshift of z = 2.13, gave a radius of a few hundred kpc. D'Odorico et al. used 
the same Bayesian model as Crott & Fang and found it impossible to distinguish between a population of disk-like 
absorbers and a population of spherical clouds with different radii. Petitjean et al. (1998) analysed HST observations 
of a QSO pair over a redshift range 0.833 < z < 1.438, and obtained a typical size of the Lya absorber of 500/i _1 
kpc. Liske et al. (2000) investigated a system of 10 QSOs concentrated in a field of 1-deg 2 over the redshift range 
2.2 < z < 3.4. They found correlations across lines of sight with proper separation < 3/i -1 Mpc. Williger et al. (2000) 
investigated a grouping of 10 QSOs in the redshift range 2.15 < z < 3.37 and found a correlation length up to 26 h^ 1 
comoving Mpc. More recently, Young et al. (2000) have analysed a triple system and have found a coherence length 
of 0.5 — 1 Mpc for a redshift range 0.4 < z < 0.9. 



© 0000 RAS, MNRAS 000, 000-000 



Probing the Intergalactic Medium with the Lya forest along multiple lines of sight 3 



Here we will use an analytical method to calculate the Lya opacity distribution of the IGM along multiple LOS 
to distant quasars in a cosmological setting. From these we calculate absorption spectra with varying transverse 
separation between LOS pairs. We then use the cross-correlation coefficient, as a measure of the characteristic size of 
the absorber, which better describes the complicated geometrical structure of the absorbers suggested by numerical 
simulations. We further investigate the virtues of the flux cross-power spectrum in constraining the underlying mass 
density field. To make connections with the observational studies mentioned above we also perform an analysis of 
coincident absorption lines as identified with the Voigt profile routine AUTOVP, kindly provided by Romeel Dave. 

The plan of the paper is as follows. Section 2 presents the lognormal model for IGM dynamics and describes 
the algorithm which allows us to simulate spatially correlated LOS through the Lya forest. In Section 3 we give the 
relations which are used to simulate the Lya flux from the IGM local density and peculiar velocity fields and we use 
the cross-correlation coefficient and the cross-power spectra to quantify the flux correlations. In Section 4 we perform 
the coincidence analysis of absorption lines fitted with the Voigt-profile fitting procedure. In Section 5 we propose 
a new procedure for recovering the 3D dark matter power spectrum by integrating the ID cross-spectra over the 
transverse separation and we show the main advantages in using the cross-spectra information. Section 6 contains a 
discussion and our conclusions. 



2 METHOD 

2.1 The lognormal model of the IGM 

We implement here the model introduced by Bi and collaborators (Bi et al. 1992, 1995; Bi 1993; BD97), to simulate 
low column-density Lya absorption systems along the LOS, which we then extend to simulate multiple LOS to 
distant QSOs. This simple model predicts many properties, such as the column density distribution function and 
the distribution of the b-parameter, which can be directly compared with observations (BD97). Recently, the BD97 
model has been used by Roy Choudhury et al. (2000, 2001) to study neutral hydrogen correlation functions along 
and transverse to LOS. Feng & Fang (2000) also adopted the BD97 method to analyse non-Gaussian effects in the 
Lya transmitted flux stressing their importance for the reconstruction of the initial mass density field. 

The BD97 model is based on the assumption that the low-column density Lya forest is produced by smooth 
fluctuations in the intergalactic medium which arise as a result of gravitational instability. Linear density perturbations 
of the intergalactic medium <5j GM (x, z), can be related to DM linear overdensities by a convolution. In Fourier space 
one usually assumes 



^^'^ = l +j/k*(z) = WGM(fc, 2 )D + ( 2 )<5? M (k) (1) 

where D + {z) is the linear growing mode of dark matter density fluctuations (normalized so that D + (0) = 1) and 
^^"^(k) is the Fourier transformed DM linear overdensity at z — 0. The low-pass filter Wigm(M) = (1 + k 2 /^)' 1 
depends on the comoving Jeans length 

1 1/2 



k-\z) = H^ 



2 7 fc B r (z) 
3/im p f2 m(I + z) 



(2) 



with fcs the Boltzmann constant, To the temperature at mean density, fi the molecular weight of the IGM, Qom 
the present-day matter density parameter and 7 the ratio of specific heats. Gnedin & Hui (1998) adopt a different 
and more accurate expression for the IGM filter Wigm(A;, z), which, however, does not allow a simple matching with 
the non-linear regime. More accurate window-function have also been proposed by Nusser (2000) and Matarrese & 
Mohayaee (2001). In what follows we take To (z) oc 1 + z, which leads to a constant comoving Jeans scale. This 
assumption should not be critical as the redshift intervals considered here are small. 

Given the simple relation between the IGM and DM linear density contrasts, one gets the following relation 
between the corresponding linear power spectra, Po GM (fc, z) = D 2 _(z)Wi GM (k, z)P(k), where P(k) is the DM power 
spectrum linearly extrapolated to z = 0. 

To enter the non-linear regime, BD97 adopt a simple lognormal (LN) model (Coles & Jones 1991) for the IGM 
local density, 



niGM(x, z) = 71igm(z) (l + S 1 M (x, 2)) = ni GM (z) exp 



f IGM / \ 
S (x, z) - 



({8\™f)D\{z) 



(3) 



where Tiigm(z) « 1.12 x 10 5 Q bh 2 (l + zf cm~ 3 . 

As stressed by BD97, the LN model for the IGM has two important features: on large scales, k <JC kj, it reduces 
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to the correct linear evolution, while on strongly non-linear scales, k 3> kj, it behaves locally like the isothermal 
hydrostatic solution for the intra-cluster gas (e.g. Bahcall & Sarazin 1978), wigm ~ TiiaM exp[—(fim p ^o /jkBT m )], 
where $o is the linear peculiar gravitational potential. 

The IGM peculiar velocity v IGM is related to the linear IGM density contrast via the continuity equation. As in 
BD97, we assume that the peculiar velocity is still linear even on scales where the density contrast gets non-linear; 
this yields 

v IGM (x, z) = E + (z) J ^Le^W 1GM (k, 2 )«5 DM (k) , (4) 

with E+[z) = H(z)f(n m ,n A )D + (z)/(l + z). Here /(fi,„,fU) = -din D+(z)/rfln(l + z) (e.g. Lahav et al. 1991, for 
its explicit and general expression) and H(z) is the Hubble parameter at redshift z, 

H(z) = ff \AWl + + Ml + z) 2 + "oa (5) 

where floA is the vacuum-energy contribution to the cosmic density and J7otc = 1 — ^Om — ^oa (ficro = for a flat 
universe) . 



2.2 Line of sight random fields 

If we now draw a LOS in the xy direction, with fixed coordinate xi we obtain a set of one-dimensional random 
fields, which will be denoted by the subscript ||. Consider, for instance, the IGM linear density contrast: for a fixed 
xi we can Fourier transform it w.r.t. the xu coordinate and obtain 

5j GM (fc||, 2 |x ± ) = D+(z) J 0ie*^ ^ W,gm(^| + k\ , z)5 c ? M (fc,| , k ± ) = D+ (z)A 1GM (k\\ ,z\ xx ). (6) 
Similarly, for the IGM peculiar velocity along the xm direction, we obtain 

v\ GM (k h z\x ± ) =v IGM (fc||,2|x_L)-X|| =ik l} E+(z)U 1GM {k h z\x ± ) , (7) 
where icy is the unit vector along the LOS and 

t/ IGM (fc,|, 2 !x ± ) = J |^e Ik - Xi ^^w.oM^q + ki,z)sr(h , kj.) • (8) 

2.2.1 Line of sight auto-spectra and cross-spectra 

We now want to obtain auto and cross-spectra for these ID Gaussian random fields along single or multiple LOS. 
Given a 3D random field V'( x ) with Fourier transform f/>(k) and 3D power spectrum P(|fc|), one can define the LOS 
random field ^ii (xn , xj_) as the ID Fourier transform 

tf|,(k|||xi) = j |^L e *— V(fc||,kx). (9) 

The cross-spectrum 7r(|fc|| \\r±) for our LOS random field along parallel LOS, separated by a transverse distance r±, 
is defined by 

(^||(fe|||x_ L )V'||(A;|||x_L+r_L)) = 27rfo(A|| +fc|)7r(|fc||||r±) , (10) 
where <5d is the Dirac delta function and 7r(fc|rx) can be related to the 3D power spectrum as follows 

<*|rx) = J 0f^ TX P{VW+V) ■ (11) 
Integrating over angles and shifting the integration variable yields 

7T(fc|rx) = — J dqqJ (r ± y/(j*-k*)P{q) , (12) 

where J„ will generally denote the Bessel function of order n. 

In the limit of vanishing distance between the two LOS Jo — > 1 and the above formula reduces to the standard 
relation for the LOS (ID) auto-spectrum in terms of the 3D power spectrum (Lumsden et al. 1989) 

* In what follows we neglect the effect of the varying distance between the lines of sight. 
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p(k) = 7v(k\r ± = 0) = ±- f dqqP(q) . (13) 

J k 

The IGM linear density contrast and peculiar velocity along each LOS can be grouped together in a single 
Gaussian random vector field V(ftii[xx) with components Vi = A IGM and V2 = U IGM . One has 

(Vi(fe|||xx)V j (fcf||xx +p±)> =2tt fo(fc„ +fcf|)ir«(|fc||||rx) , t,j = 1,2, (14) 

where the 2x2 symmetric cross-spectra matrix irij(k\r±) has components 

^n(fc|rx) = — / dqqj (r ±y /q* - W)W? GU (q,z)P{q) , (15) 

^ 22 (fc|rx) = ^^ ^|jo(rx^ 2 - fc 2 )Wf GM (g )Z )P(?) , (16) 

7r 12 (fc|rx) = ^- / ^Jo(r-xVg^^)W I 2 GM ( ? ,^)P(g). (17) 

For vanishing transverse distance between the lines, Jo(0) = 1 and the auto-spectra components are given by Pij(k) = 
irij(k\r± — 0), which we will also need in the following. 



2.3 Correlation procedure 

Our next problem is how to generate the two random fields A IGM and f/ IGM in ID Fourier space (—00 < fen < 00). 
These random fields have non-vanishing cross-correlations but unlike in 3D Fourier space they cannot be related by 
simple algebraic transformations. 

We gan generally write any M-dimensional Gaussian random vector V with correlation matrix C and components 
Cij = (ViVj), as a linear combination of another M-dimensional Gaussian random vector X with diagonal correlation 
matrix^, which we can take as the identity I without any loss of generality. The transformation involves the M x M 
matrix A, with components 0%-, as follows: V = AX. One gets C = AA T , i.e. dj = ^ fc aikOtjk- There is a slight 
complication because V is a random vector field defined in ID Fourier space. We can, however, extend the above 
formalism to vector fields, assuming that X is a Gaussian vector field with white-noise power spectrum, 

(X l {k { ()X ] {k\)) = 2n 5 i:i fo(fe|, +fc'|) (18) 

(8ij is the Kronecker symbol). We then have Vi = X/j-i ai i-^-i where 3 of the 4 otij components are determined by 
the conditions J^t =1 ctikOjk = Pij- The remaining freedom (due to the symmetry of the original correlation matrix) 
can be used to simplify the calculations. A simple choice of coefficients which solves our problem is 

«11 = V Pll — P 12 /P22 , Ql2 = Pl2/^/p22 , OL1\ = , «22 = VP22 ■ (19) 



2.4 Multiple lines of sight 

It is straightforward to extend our formalism to simulate the IGM properties along parallel LOS. Let V(feii) and 
W(fc||) be two ID Gaussian random vector fields obtained as in Section 2.3, each with the same set of coefficients otij 
but starting from two independent white-noise vector fields X and Y (i.e. such that (XiYj) = 0). Then both V and 
W have the correct LOS auto-spectra by construction while their mutual cross-spectra vanish: (Vi(k^)Wj(k^)) = 0. 

Let us further define a new vector V'(fe||[rx) with components V( = X/k-i (ftfcVfe + yikWk), such that its auto 
and cross-spectra components are given by, 

<Vi(fc||)Vj(*fl)> = (Vi'(fc|||rx)^(fcf||rx)>=27rfo(fc u +fcj,)py(|fc|||) , 
(Vi(fc||)V/(*J|r±)> = 2tt fo(*n +fc||)ir y (|fc||||rx) . (20) 

The vectors V and V' will then represent our physical IGM linear fields on parallel LOS at a distance r±. They 
will be statistically indistinguishable from those obtained by drawing two parallel LOS separated by r± in a 3D 
realization of the linear IGM density and velocity fields. 

The transformation coefficients are determined by the equations l=1 {f3ikPji + yikjje) Pki = Pij and 

T Simple applications of this general 'correlation procedure' in the M=2 case are given e.g. in (Bi 1993) and (Porciani et al. 
1998). 
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Yll-i PikPkj = vrij. Once again, due to the symmetry of the cross-spectra components, we can choose one of the four 
jij coefficients arbitrarily. The explicit form of f3ij and chosen set of 7^ is 



(hi 
and 

711 = ± 



P11P22 - p\ 2 
7T21P22 — 7T22P12 



PllP22 



"Pl2 



P22 (AuA 2 2 - A\ 2 ) 



012 
P22 

712 = 



^12Pll — ^llPl2 

P11P22 -P? 2 
7T22P11 — 7T12P12 



P1IP22 



P 2 12 



(21) 



A 12 TP12 



AnA 2 2 



A 2 

A 12 



P 2 12 



A22 {pilP22-p\ 2 ) ' VP22A22 [ " " V P11P22 

721 = , 722 = \J A 22 /p 22 , (22) 

where 

An = pn - (/3npn + 2/?ii/3i 2 pi2 + , (23) 

A12 = P12 ~ fill (AnPll + /322P12) - (3l2 (/?2lP21 + /?22P22) , (24) 

^22 = P22 - (/SfiPn + 2/3 2 i/322Pi2 + /3I2P22) • (25) 

With this technique we can produce large ensembles of spatially correlated LOS pairs with both high resolution 
and large redshift extent. In this way we can fully account for the effects of cosmic variance on LOS properties. 

The same technique can be extended to obtain multiple LOS at the obvious cost of more and more complicated 
transformation coefficients. Alternatively, a two dimensional array of LOS in a region of the sky could be simulated. 
This will be described in a future paper. Here we only consider the case of LOS pairs. 



3 FLUX CORRELATIONS IN ABSORPTION SPECTRA OF QSO PAIRS 
3.1 Simulating the flux distribution of the Lya forest 

To simulate the Lya forest one needs the local neutral hydrogen density nm(x, z) and the corresponding Lya optical 
depth t(zo). In the optically thin limit, assuming photoionization equilibrium, the local density of neutral hydrogen 
can be written as a fraction fm(T, J21, n e ) of the local hydrogen density «h(x, z), which in turn is a fraction X « 0.76 
of the total baryon density tiigm(x, 2). Here F_i2, the hydrogen photoionization rate in units of 1CP 12 s _1 , is defined in 
terms of the UV photoionizing background radiation as 4x J21, where J(v) = J2i(vo/v) m x lCT 21 crg s _1 Hz _1 cm _2 sr _1 
with vq, the frequency of the HI ionization threshold, and m is usually assumed to lie between 1.5 and 1.8; n e (x, z) 
is the local number density of free electrons. In the highly ionized case (nm <S niGM) of interest here, one can 
approximate the local density of neutral hydrogen as (e.g. Hui, Gnedin & Zhang 1997) 

- 10- wo„w (%iy'" (i±i)'o + w*.)r . m 

The temperature of the low-density IGM is determined by the balance between adiabatic cooling and photoheating 
by the UV background, which establishes a local power-law relation between temperature and density, T(x, z) = 
T (z)(l + 5 IGM (x,z)) 7(z)_1 , where both the temperature at mean density To and the adiabatic index 7 depend on 
the IGM ionization history (Meiksin 1994; Miralda-Escude & Rees 1994; Hui & Gnedin 1997; Schaye et al. 2000). 
The absorption optical depth in redshift-space at u (in km s _1 ) is 

/oo 
dy n m {y) V [u - y - v l ° M {y), b{yj\ dy , (27) 

where ao, a = 4.45 x 10~ 18 cm 2 is the hydrogen Lya cross-section, H(z) is the Hubble constant at redshift z, y is 
the real-space coordinate (in km s _1 ), V is the standard Voigt profile normalized in real-space, b = (2fcgT/mc 2 ) 1 ' 2 
is the velocity dispersion in units of c. For the low column-density systems considered here the Voigt profile is well 
aproximated by a Gaussian: V = (y/iib)^ 1 cxp[— (u — y — «|| GM (2/)) 2 /& 2 ]- As stressed by BD97 peculiar velocities affect 
the optical depth in two different ways: the lines are shifted to a slightly different location and their profiles are 
altered by velocity gradients. The quantity T_i2 is treated as a free parameter, which is tuned in order to match the 
observed effective opacity t c b(z) = — ln(exp (— r)) (e.g. McDonald et al. 1999; Efstathiou et al. 2000) at the median 
redshift of the considered range (t c h = 0.12 and r e s = 0.27 at z = 2.15 and z = 3, respectively, in our case). We 
account for this constraint by averaging over the ensemble of the simulated LOS. The transmitted flux is then simply 
T — cxp(— r). Let us finally mention that Bi (1993) simulated double LOS with a simplified scheme which neglects 
the effects of peculiar velocities. 
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SCDM 




1.8xl0 4 



2xl0 4 2.2xl0 4 

vel (km/sec) 



2.4x10* 



Figure 1. Simulated QSO spectra for a SCDM model at z = 2.15. The y-axis represents the flux, the x-axis is in km s 1 . Each 
of the bottom five panels has the correct correlation properties with regard to the top panel at proper distance 0.03 Mpc/h, 
0.06 h' 1 Mpc, 0.12 h^ 1 Mpc, 0.24 /i" 1 Mpc, 0.48 h' 1 Mpc, from top to bottom. 



3.2 Absorption spectra of QSO pairs in cold dark matter models 

We have simulated a set of LOS pairs all based on the cold dark matter (CDM) model but with different values of the 
normalization, ug, vacuum energy content, Qoa = 1 — f^Om, Hubble constant h = iio/100 kms -1 Mpc -1 and spectral 
shape-parameter F. A linear power spectrum of the form P(k) oc kT 2 (k) was assumed, with T(k) the CDM transfer 
function (Bardeen et al. 1986): 

T(q) = lD(1 2 + 3 ' g 34g) x [1 + 3.89, + (16.1g) 2 + (5.46gf + (6.71#] ^ , (28) 

where q — k/hY. The shape-parameter V depends on the Hubble parameter, matter density Q 0m and baryon density 
Qob (Sugiyama 1995): 

T = Qom hexp[-Q ob - y/h/0.5Qub/n m)] ■ (29) 
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ACDM 




1.8xl0 4 2xl0 4 2.2xl0 4 2.4xl0 4 

vel (km / sec) 

Figure 2. The same as in Figure M for a ACDM model, with the same random phases. 



We have simulated a cluster-normalized Standard CDM model (SCDM) (Qom = 1, h = 0.5, as = 0.6, Slot = 0.019 ft 2 ), 
a ACDM model (fi 0m = 0.3, Q A = 0.7, ft = 0.65, n ob = 0.019 ft 2 ) and a rCDM model (Q 0m = 1, h = 0.5, cr 8 = 0.6, 
Slob = 0.019 ft 2 , T = 0.187). 

The redshift ranges of the Lya forest are 1.90 < z < 2.40 (3525 A< A < 4133A) and 2.75 < z < 3.25 (4556 
A< A < 5163A). We use ID grids with 2 14 = 16,384 equal comoving-size intervals. In the first case the box length 
is 538 comoving Mpc for the SCDM and 718 comoving Mpc for the ACDM model, while in the second case is 378 
comoving Mpc for the SCDM and 518 comoving Mpc for the ACDM. These intervals have been chosen so that the 
size of the box, expressed in km s _1 , is the same for all the models in each redshift interval. In the low-redshift case 
our box size is 47690 km s _1 , while in the high-redshift one is 37540 km s _1 . 

The adopted procedure to account for observational and instrumental effects follows closely that described in 
(Theuns, Schaye & Haehnelt 1999). We convolve our simulated spectra with a Gaussian with full width at half 
maximum of FWHM = 6.6 km s _1 , to mimic QSO spectra as observed by the HIRES spectrograph on the Keck 
telescope. We then resample each line to pixels of size 2 km s _1 . Photon and pixels noise is finally added, in such a 
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Figure 3. Auto-spectrum (top panel on the left) and three flux cross-spectra obtained with the definitions given in the text, for 
the SCDM model. The x-axis represents log A;, where k is defined as 2tt/v, the y-axis is the log of auto-spectra and cross-spectra. 
The three cross-spectra have been obtained taking 10 QSO pairs whith three different proper separations (0.1 /i -1 Mpc, 0.2 
h' 1 Mpc, 0.3 h' 1 Mpc). The dashed curve is the linear prediction of the ID auto-spectrum and cross-spectrum as given in eq. 
( |l3| ) and in eq. (|lfj). Error bars are the error of the mean value. 



way that the signal-to-noise ratio is approximately 50 but it varies as a function of wavelength and flux of observed 
QSO spectra as estimated from a spectrum of Q1107 + 485. 

Figures |l| and |^ show the transmitted flux for a sequence of LOS with varying transverse distance rj_ , for SCDM 
and ACDM model respectively. In each sequence the first LOS is kept fixed (the one on the top) and the value 
of r± varies in the cross-spectra while the phases are kept constant; the second member of each pair has thus the 
required auto and cross-correlation properties. Notice that only pairs which include the first LOS have the required 
cross-correlation properties. 

The figures are very similar because we use the same phases in both models, so that the differences can be better 
appreciated. Coherent structures extend out to hundreds of kpc/h proper (several comoving Mpc) in the direction 
orthogonal to the LOS. These can be understood as the signature of the underlying 'cosmic web' of mildly non-linear 
sheets and filaments (Bond, Kofman & Pogosyan 1996) in the dark matter distribution, which is smoothly traced by 
low-column density Lya absorption systems. 



3.3 Statistical analysis of the flux correlations 

We compute the auto-spectra pr{k) and the cross-spectra -K^(k\r±) of the flux for the two cosmological models using 
the definitions pr{k) = (\J-o(k)\ 2 ) and -K^(k\r±) = (Re(J r o(k)J r i(k,r±))), where J~o(k) and J-i(k) are the Fourier 
components of the flux along the two LOS at distance r±_ and the symbol (•) denotes the ensemble average. In Figures 
^| and ^ we plot the auto-spectrum (top panel on the left) and the three cross-spectra at proper distance 0.1 Mpc, 
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Figure 4. Same plot as in Figure H for a ACDM model 



0.2 Mpc and 0.3 h~ x Mpc. The total number of simulated LOS pairs is 30. The results are ensemble averages 
of 10 pairs at each separation and the error bars represent the scatter of the mean value. The range of k = 2n/v 
(v is the velocity in km/s) plotted here does not include the small scales (high k) strongly affected by pixel noise 
and non-linearity effects where the power spectra flattens again (Theuns, Haehnelt & Schaye 2000, McDonald et al. 
2000). The dashed line represents the theoretical prediction of the linear power spectra as given by equation 

©. The 

agreement is good over a wide range of wavenumbers k, roughly —3 ,$ logfc^S — 1. This is the interval we will use 
in Section 6 to recover the 3D power spectrum of the linear density field and is close to the range of k-wavenumbers 
used by Croft et al. (1999) from the analysis of observational data. 

It is important here to stress that the simulated spectra have been produced in redshift-space, while the theory 
is in real-space. We have checked the difference by recomputing eq. ( jlil) and eq. (|l5| ) considering also redshift-space 
distortions, i.e. using the distortion kernel proposed by Hui (1999), and the differences are negligible. Given this 
reasonably good agreement, all the following comparisons between the simulated spectra and the theory have been 
made without taking into account the redshift-space distortions in the theoretical equations. 

We have also measured the flux cross-correlation coefficient x( r i) as a function of separation, binning the data 
in bins with width Av. We choose the following definition, 

x(ri ) l^-<^«-<^», (30) 

where J-q and T\ are the fluxes of the binned spectra of pairs with separation r±, N pix is the number of pixels of the 
binned spectrum and <jt , the standard deviation of the two fluxes. 
This function can be related to the auto and cross-spectra as follows, 
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Figure 5. Cross-correlation coefficient \ plotted as a function of angular separation 9 (left panel) and comoving separation r± 
(right panel). Points are the outputs of simulated QSO pairs for a SCDM (open circles) a rCDM with T ~ 0.2 (filled circles) 
and a ACDM model (triangles). The results are averaged over 8 pairs for each 9 (15, 30, 40, 55, 65, 80). The curves in the right 
panel are the theoretical predictions of equation (j3l|). The dotted curve is for rCDM, the dashed is for ACDM while the solid 
curve is for SCDM. The error bars represent the scatter of the distribution. 

J dk e~ k2 / k sp(k) 

where fc" 1 oc (Aw) - . Note that the flux cross-correlation coefficient does not depend on the amplitude but only on 
the shape of the power spectra. Using this function we can define a transverse coherence scale r c ± as the distance 
between two LOS at which x( r c±) = 0.5. Analytical estimates of x( r ±) f° r the various CDM models can be obtained 
by replacing p and n with the IGM linear density auto and cross-spectra pn and iru in the above relation. 

We compare here three cosmological models, a SCDM, a ACDM and a rCDM model (r ~ 0.2) at seven angular 
distances (15, 30, 40, 55, 65, 80, 100 arcsec) at redshift 2.15. This corresponds to different comoving distances r± in 
the three different cosmologies (e.g. Liske 2000), 



Wo I ^ 



(l + z'f + Q 0A ] 1/2 dz' (32) 



The IGM temperature at mean density and the temperature-density relation parameter are assumed to be To = 10 4 ' 2 
K and 7 = 1.3, respectively. 

We have generated 8 pairs of spectra for each angular separation in the usual way. The results are plotted 
in Figure [| for the three models. The left and right panels show the cross-correlation coefficient against angular 
and comoving separation, respectively. The cross-correlation coefficient has been calculated directly from the whole 
spectrum (binned with Av ~ 3 km s _1 ). The three solid curves are calculated using equation (pr[), where 7r(fc|rj_) and 
p(k) are replaced with the corresponding quantities for the IGM (tth and pn). The agreement with the theoretical 
prediction (with k s ~ 27r/3 km -1 s) is reasonably good. 

Using x = 0.5 as the definition for the coherence length gives 0T0T0.04 h' 1 Mpc for SCDM, 0T3T0.03 h' 1 Mpc 
for rCDM and 0.17 ±0.05 h' 1 Mpc for ACDM (proper). The cross-correlation coefficient depends on the detailed 
shape of the IGM power spectrum at and above the Jeans length. In the SCDM model the power spectrum falls most 
steeply towards larger scales and this results in the shortest coherence length of the three models. The rCDM model 
with its flatter power spectrum on the relevant scales has a significatly larger coherence length. The ACDM model 
has an even larger coherence length due to the larger Jeans length for smaller Qom at fixed temperature (eq. 2). 
However, the plot which can be directly compared with observations is the one on the left in Figure [| which does not 
invoke any a priori assumption on the cosmological model. Unfortunately, the f2om dependence becomes negligible 
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Figure 6. The ratio of the number of coincidences to the number of coincidences plus anticoincidences, / co , as a function of 
the proper separation of QSOs in ACDM model at a median rcdshift z = 2.15. Left and right panel are for a velocity difference 
Av = 50 km/s and Av = 150 km/s, respectively. The three solid curves show f co for lines with a column density higher than 
JVjjj = 10 12 cm~ 2 , 10 13 cm~ 2 and 10 14 cm~ 2 (from top to bottom). The three solid horizontal lines represent the level of 
random coincidences as estimated from 10 pairs of uncorrelatcd LOS, for the same column density thresholds. 



if the cross-correlation coefficient is plotted against angular separation. The cross-correlation cofficient can, however, 
be used to constrain the shape of the DM power spectrum (which we have chosen here to paramerize with F) if the 
temperature To and the temperature density relation coefficient 7 are determined indepedently. 

We have also run simulations with a wider range of model parameters, changing V, flom, ""8 and the parameters 
describing the physics of the IGM, such as To, 7, r_i2. If r e // is fixed over the whole ensemble of simulations the 
dependencies on the amplitude of the power spectrum at that redshift and on T_i2 largely cancel. 

The coherence length as defined above does not rely on assumptions about the shape of the absorbers. If the 
IGM indeed traces the filaments in the underlying dark matter distribution this should be a more adequate measure 
of the 'characteristic size' of the absorbers than the usually performed coincidence analysis of absorption lines fitted 
with a Voigt profile routine (see next section). 



4 COINCIDENCE ANALYSIS OF LYa ABSORPTION LINES IN QSO PAIRS 

In this section we perform a coincidence analysis of absorption lines in the spectra along adjacent lines of sight (see 
Charlton et al. 1997 for a corresponding analysis of a hydrodynamical simulation). We have used the Voigt profile 
fitting routine AUTOVP (Dave et al. 1997) to identify and characterize the absorption lines. For this analysis we 
have generated absorption spectra of 5 QSO pairs in the range 1.9 < z < 2.4 for proper distances 0.05,0.1,0.2,0.4 
and 0.6 Ir 1 Mpc in the ACDM model only. 

Charlton et al. (1997) demonstrated that for the filamentary and sheet-like absorping structures expected in 
hierachical structure formation scenarios the characteristic absorber 'size', as determined by counting 'coincident' 
and 'anticoincident' lines in QSO pairs, will depend on the separation of the QSO pair and on the column density 
threshold used. The characteristic size determined in this way is thus of little physical meaning and difficult to 
interpret. Nevertheless, such an analysis is useful in order to make connections with published observational studies 
which usually perform such an analysis. 

We adopt the definition of Charlton et al. (1997) based on the 'hits-and-misses' statistics described in McGill 
(1990). A coincidence is defined as the case in which an absorption line is present in both spectra within a given 
velocity difference Av and above some signal-to-noise ratio. An anticoincidence is defined when a line is present in one 
but not in the other spectrum. If there are two lines within Av we count only one coincidence and no anti-coincidence, 
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Figure 7. The same as in Figure |3| but at redshift 2 = 3. 

as in Fang (1996). We also generate 5 uncorrelated LOS and compute the number of coincidences and anticoincidences 
in these 10 pairs to estimate the level of random coincidences. 

In Figure ^ we plot the quantity f co , the ratio of the number of coincidences to the sum of all coincidences and 
anticoincidences as a function of proper distance. The three curves are for different column density thresholds (10 12 , 
10 13 and 10 14 cm -2 ) and the left and right panels are for Av — 50, 150 km s _1 , respectively. The solid horizontal 
lines show the level of random coincidences as estimated from the ensemble average of 10 uncorrelated spectra. The 
curves depend strongly on the choice of the velocity difference Av. 

In Figure [7] we plot the same quantities but for spectra with a median redshift z = 3, i.e. the LOS span the 
range 2.75 < z < 3.25. At fixed redshift, f co is larger if the velocity difference Av allowed for a coincidence is larger. 
This is easily understood as the chance to get a 'hit' becomes higher. There is also a significant trend with redshift. 
With increasing redshift both f co and the level of random coincidences increase, the latter by a factor of two. These 
findings are similar to those of Charlton et al. (1997) (their Figure 2). Our level of random coincidences is somewhat 
smaller than that in Charlton et al. (1997), probably due to a different temperature which results in a larger Jeans 
scale. 

In Figure ^| we show scatter plots of the neutral hydrogen column densities of coincident lines above a column 
density threshold of 10 12 cm -2 . For the smaller separations the column densities are well correlated while for larger 
distances the column density difference rapidly increases. This is again not surprising as for large separations the 
'coincidences' occur mostly by chance. The same analysis has been done for lines found at z — 3. The result is very 
similar to the one found at z — 2.15. 

The results presented in this section depend on details of the Voigt profile fitting and the velocity difference 
and column density threshold chosen to do the 'hits-and-misses' statistics. This makes it difficult to infer physically 
meaningful properties of the absorbers, as for example their characteristic size, with these techniques. The coherence 
length defined in the last section is more useful in this respect. 



5 RECOVERING THE 3D DARK MATTER POWER SPECTRUM USING FLUX 
CROSS-SPECTRA 

5.1 A new method for obtaining the 3D dark matter spectrum from the flux auto-spectrum 

If the effect of peculiar velocities, thermal broadening and instrumental noise on the flux fluctuations at small scales 
are neglected the transmitted flux at redshift 2 in a given direction can be approximated as (e.g. Croft et al. 1998; 
Theuns et al. 1999) 
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Figure 8. Column density of neutral hydrogen, log TVhi, for pairs of coincident absorption lines identified with the Voigt profile 
fitting routine AUTOVP in a ACDM model at z = 2.15. A velocity difference of Av = 50 km/s was allowed for coincident lines. 
Four different proper separations are shown: rj_ = 0.05 h _1 Mpc, top left panel; rj_ = 0.1 h~ 1 Mpc, top right panel; rj_ = 0.2 
h _1 Mpc, bottom left panel and r±_ = 0.4 h~ 1 Mpc, bottom right panel. 



exp 



-A(l + S^ {x(8,z),z)Y 



(33) 



where f3 w 2 — 0.7(7 — l)i while A is a normalization constant of order unity which determines the mean flux in 
the considered redshift interval. Eq. ( |33| ) is valid only in redshift-space and the effect of peculiar velocity and thermal 
broadening increases the scatter in the relation between T and <5 IGM (Croft et al. 1998). 

If smoothed on a sufficiently large scale the IGM overdensity can be treated as a linearly fluctuating field. In this 
case the fluctuations of the flux, 5J-, are simply related to the linear baryon density perturbations, 



-APSI GM (x(0, 



(34) 



On large scales observed absorption spectra can thus be used to recover the 3D primordial power spectrum of 
DM perturbations. The standard procedure suggested by Croft et al. (1998, 1999, 2000) inverts the relation in eq. 
( [l3| ) to obtain the 3D power spectrum by differentiating the ID auto-spectrum, 

Alternative methods to measure the amplitude of DM fluctuations and their power spectrum have been proposed by 
Hui (1999) and by Nusser & Haehnelt (1999, 2000). 

However, the 3D power spectrum can also be reconstructed by integrating the ID cross-spectrum ir(k\r±) over 
the transverse separation between LOS pairs. Indeed, by inverse Fourier transforming eq. di"l] ) on the plane spanned 
by ri, and integrating over angles, we find 
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Figure 9. 7Ti? as a function of rj_ (in proper h 1 Mpc at z = 2.15) for 4 different values of fc: k = 0.169, 0.561, 1.005, 1.806 
Mpc -1 for SCDM model (left panel, from top to bottom) and k = 0.125, 0.414, 0.742, 1.333 Mpc" 1 for the ACDM model (right 
panel, from top to bottom). The dashed curve is the theoretical prediction given by eq. (|l5j), not corrected for redshift-space 
distortions. The points are calculated for an ensemble of 30 realizations, 10 realizations for each separation: ri = h~ l Mpc 
(which gives the auto-spectrum) and = 0.1,0.2,0.3 Mpc. 



r poo j 

P(k) = 2-n dr ± r ± J (r ± ^fc 2 - g 2 ) n(q\r ± ) = / — Q(r ± \k,q) , q<k. (36) 

Jo Jo r± 

The above results also lead to a useful 'consistency relation' between the LOS auto-spectrum p(k) and the cross- 
spectra ir(k\r±) along LOS pairs, 

n(k\r±) =p(k) — r± J dqJi(qrj_)p (^/<f + k 2 ^j . (37) 

As a consequence of the assumed homogeneity and isotropy, the RHS of equation (^6|) does not depend on q; we 
can therefore simplify the integral by taking q — k, 

poo 

P(k) = 2tt / dr ± r ± Tv(k\r ± ) . (38) 
Jo 

The flux cross-spectrum TYF(k\r±) is plotted in Figure ^ as a function of r± for a SCDM (left panel) and a 
ACDM (right panel) model, from an ensemble of 30 pairs at different separations. The different set of points are for 
different values of k in the range for which the agreement with theoretical predictions is good. The r± = ft -1 Mpc 
point which represents the auto-spectrum is also shown. The 3D power spectrum can be obtained using eq. (p^). To 
estimate P(k) we can use the values of iTF(k\r±) obtained from the simulations in the analytical expression. Also in 
this case the reconstruction procedure is based on eqs. which neglect the effects of redshift-space distortions, while 
the spectra have been produced in redshift space. 

More generally the redundancy shown by the q-dependence of the integrand in the RHS of eq. ( |36[ ) can be 
exploited to choose a weighting such that the dominant contribution to the integral comes from the separation range 
where the signal-to-noise ratio and/or the number of observed pairs is highest. 

This point is made more clear by Figure where the quantity Q is plotted as a function of the transverse 
separation r± for various values of k and q. In Figure [lo] the solid curves show the theoretical predictions for Q in a 
ACDM Universe: in the left panel for q — k; in the right panel for q — k/3. It is evident that the behaviour is very 
different: for q = k the Bessel function Jo is equal to 1, while for q = fc/3 it acts as a filter which takes both positive 
and negative values. 

To simplify matters we dropped the explicit redshift dependence of the power spectra in the above discussion 
and used power spectra which were averaged over some redshift interval. The recovered 3D power spectrum will then 
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Figure 10. Q as a function of ln(rj_) in ACDM model for three values of k: k = 0.1251,0.4142,0.7422 Mpc" 1 (from top to 
bottom) and for two choices of q: q = k in the left panel and q = k/3 in the right panel. 



generally depend on the median redshift of that interval. Moreover, the assumed isotropy is generally broken, e.g. due 
to the effect of peculiar velocities. A more refined treatment should thus account for various sources of anisotropy in 
the reconstruction procedure by applying for example the techniques proposed by Hui (1999), McDonald & Miralda- 
Escude (1999) and Hui et al. (1999). These and other aspects of the power spectrum reconstruction will be discussed 
elsewhere. 

A reconstruction procedure for the DM power spectrum based on eqs. ( pioj ) or ( |38[ ) has an obvious advantage 
over the standard method based on eq. (^H|): one integrates rather than differentiates a set of generally noisy data. 
To investigate this we have performed the reconstruction of the initial 3D power spectrum in three different ways: 

• differentiation of simulated ID auto-spectra, based on eq. (^); 

• integration of ID cross-spectra, recovered from simulated ID auto-spectra, based on eqs. (^) and (jj^); 

• fitting of simulated ID cross-spectra, based on eq. ( |l^ ) . 

At first we generate 15 LOS and then compute the flux auto-spectra. We choose to smooth the auto-spectra 
with a polynomial function before using eq. (^) to recover the 3D power spectrum. In panel (a) of Figure [H] one 
can see that the agreement with the theoretical prediction is good over a wide range of wave-numbers. The error 
bars represent the scatter over the distribution of the 15 recovered 3D power spectra. Almost all the la error bars 
of the points match the continuous line which represents the 3D linear IGM power spectrum. We stress the fact 
that this technique has been used 'directly' on simulated fluxes, without the use of further assumptions, such as 
Gaussianization (e.g. Croft et al. 1998). 

In panel (b) of Figure [H] we report the results obtained with the second technique based on equations (^) and 
(Bq), using only the auto-spectra information. For each of the 15 flux auto-spectra we compute the cross-spectra for 
a large number of separations via eq. (^). Next we use these estimates in eq. (|38|), i.e. we integrate the ID flux 
cross-spectra along the transverse direction. The agreement with linear theory of this 'integration' technique is very 
good. In this panel the error bars represent the scatter over the distribution of recovered 3D power spectra. The 
results obtained with these two techniques are basically equivalent provided we smooth the simulated data. In a sense 
this method provides a natural choice of a smoothing function. If we apply these two methods directly without any 
smoothing then 'differentiation' is less accurate in recovering the 3D dark matter power spectrum at large scales 
and 'integration' produces smaller error bars. Another significant result is that both methods fail to recover the 3D 
dark matter power spectrum for large wave-numbers (fc<^ 0.1 km -1 s): at small scales, peculiar velocities, thermal 
broadening and nonlinear gravitational effects are responsible for a drop of the flux power spectrum below that 
predicted by linear theory. 



Our third method uses eq. (|l2|): we fit the simulated cross-spectra with a ACDM power spectrum with the 
amplitude as a free parameter and determine the best estimate of the amplitude by \ 2 minimization. We thereby 
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Figure 11. 3D power spectrum reconstruction. In panel (a) points represent the 3D power spectrum obtained with 'differenti- 
ation', i.e. using eq. J35|) ■ In panel (b) points represent the same quantity obtained with 'integration', i. e. using eqs. (^) and 
(|38|). The dotted curve is the 3D power spectrum of the dark matter, while the solid curve is the 3D power spectrum of the 
IGM. 



fitted only the part at small wave-numbers (scales larger than few Mpc) less affected by non-linear effects, peculiar 
velocities and Jeans length smoothing. A further advantage of using the cross-spectra is that in a set of N auto- 
spectra, there are N x (TV — l)/2 cross-spectra, so this second sample could in principle be significantly larger than 
the first. We proceed as follows. First we calculate the error in estimating the amplitude of the 3D power spectrum 
by using only 5 flux auto-spectra. Then, having generated a set of QSO pairs with given separations, we determine 
the minimum number of QSO pairs needed to recover the amplitude with the same accuracy and with a mean 
value compatible within lcr with the amplitude recovered from the 5 flux auto-spectra. We find that from a set of 5 
simulated flux auto-spectra it is possible to determine the logarithm of the amplitude with an accuracy of 1%. We 
find that at large separations one needs more pairs to give an accurate estimate of the amplitude. For separations 
smaller than 2 comoving Mpc we need less than 10 QSO pairs to reach the same accuracy and the same amplitude 
of the auto-spectra estimate. A detailed method that recovers not only the amplitude but also the slope of the 3D 
power spectrum, based on eq. (|l^) will be described in a future paper. 

We conclude from our comparison that with the 'integration' technique the correct slope of 3D dark matter 
power spectrum can be inferred on scales larger than 1 comoving Mpc. This method is complementary to the usual 
'differentiation' and is very accurate. Our 'fitting' technique for the flux cross-spectra of a set of QSO pairs can also 
constrain the 3D power spectrum. 



5.2 Cross-spectra as a means of overcoming limitations of auto-spectra due to continuum fitting 

As discussed by Croft et al. (1998, 2000) and Hui et al. (2000), at scales k < 0.005 km -1 s corresponding to about 
10- 15 ft" 1 Mpc the errors in observed 3D flux auto-spectra as determined by a bootstrap analysis increase rapidly. 
Errors in the continuum fitting procedure are likely be the main contributor. In principle such continuum fitting 
errors can both increase or decrease the amplitude of the flux auto-spectrum. 

Cross spectra should in principle not be affected by any modulation of the flux which is uncorrelated between 
adjacent LOS even though the shot noise will increase with increasing fluctuation amplitude. Continuum fitting should 
thus affect flux cross-spectra much less than flux auto-spectra. 

In this subsection we verify that this is indeed the case and assess how many quasar pairs are needed to extend 
measurements of the DM spectrum to scales as large as 60 h~ x Mpc or more. We first mimic the effect of errors in the 
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Figure 12. Spectra from three QSO pairs with continuum errors added as described in the text 



continuum fitting procedure on flux auto and cross-spectra and then go on to demonstrate explicitly that continuum 
fitting may actually not be necessary for the analysis of flux cross-spectra. 

To mimic the effect of continuum fitting errors we do the following. We take the continuum of the QSO Q1422+231 
kindly provided by Michael Rauch (Rauch et al. 1997). From this we generate a series of spectra with continuum 
fitting errors by changing the amplitude of the Fourier modes of our analytical spectra at all k corresponding to 
scales larger than 15 comoving Mpc, i.e. the scales most likely affected by errors in the continuum fitting procedure. 
We add to the old amplitude a quantity which varies randomly between — 10 % to 10 % of the amplitude of the 
corresponding Fourier mode of the continuum of Q1422 + 231. Then we calculate a new spectrum keeping all the 
phases of the Fourier modes of these continua in order to preserve the characteristic emission lines of the continuum 
of Q1422 + 231. We also shift the spectrum randomly in redshift to avoid correlations due to the charateristic emission 
lines. Similarly we produce spectra with different independent continua by randomly varying the large scale Fourier 
modes of the continuum of Q1422 + 231 by ± -15 %. In the following we will refer to the first set of spectra as the 
no continuum case and to the second as the continuum case. We have simulated spectra of 30 QSO pairs in a ACDM 
Universe with an angular separation of 100 arcsec in this way. 

Examples of spectra of three QSO pairs with 'continuum fitting errors' are shown in Figure [j^ (no continuum 
case). Our 'continuum fitting errors' are clearly visible by eye and are significantly larger than those that should be 
achievable with a careful continuum fitting procedure. 



We have computed flux cross-spectra and auto-spectra for the two samples. The results are reported in Figure 
|l3| , where we plot the value of the flux cross-spectra and auto-spectra computed from a given number of pairs, at two 
fixed values of k = 5 x 10 -4 km -1 s (left panels) and k = 10 -3 km -1 s (right panels). These correspond to scales of 60 
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Figure 13. Flux cross-spectra (triangles) vs. number of QSO pairs. Points represent the flux auto-spectra. The solid curve is 
the theoretical prediction for the IGM linear density cross-spectrum, cq. while the dotted curve is the theoretical prediction 

for the IGM linear density auto-spectrum, eq. (|13|). The left panels are for k = 5 X 10 — 4 km -1 s, corresponding to a scale of 
about QQh~ 1 Mpc; the right panels are for k = 10 -3 km -1 s, corresponding to a scale of about 120 h~ 1 Mpc. The upper panels 
refer to the no continuum case, in which no continuum is present and we mimic the errors introduced by the continuum-fitting. 
The bottom panels refer to the continuum case, in which we directly cross-correlate spectra without removing the continuum. 
Error bars are the error of the mean values. 



and 120 /i -1 Mpc, respectively. The upper panels refer to the no continuum case and the bottom ones to continuum 
one. 

In all four cases shown here the flux auto-spectra (points) give values significantly larger than those of the linear 
prediction (dotted line), as determined by eq. (P_3h. The flux cross-spectra (triangles), however, converge to the right 
mean value (continuous line) obtained from eq. (|15[), although the number of pairs needed is not small. This is due to 
the fact that the fluctuations in the continua of the QSO eventually cancel out when the spectra are cross-correlated, 
while they add in quadrature to the flux auto-spectrum. 

The variations of the flux cross-spectrum around the mean value are due to the combined effect of cosmic variance 
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and the shot noise due to the continuum fitting errors and the continuum fluctuations, respectively. At k = 1CP 3 
km - s the ID flux cross-spectrum can be obtained with an accuracy of 2% in logarithmic units from 30 QSO pairs in 
both the continuum and the no continuum case. It should thus be possible to constrain the rms fluctuation amplitude 
of the matter density at scales of 60 h~ l Mpc with an accuracy of about 15%. Such a measurement should not require 
continuum fitting. At even larger scale the errors for the continuum case start to blow up with 30 QSO pairs. A larger 
number of pairs would be required to beat down the shot noise. 

Attempts to recover the 3D power spectrum on large scales using the flux auto-spectra clearly do not determine 
the correct value if continuum fitting errors are present. As demonstrated above it should, however, be possible to 
overcome this limitation by the use of flux cross-spectra. 

A simple method to recover the right power spectrum for small wave- numbers could be based on eq. (|l2|). We can 
parametrize the power spectrum and then compute the cross-spectra with r± equal to the separation of the pairs. The 
slope of the power spectrum at large scales can then be constrained by \ 2 minimization. A more detailed discussion on 
the use of ID cross-spectra for the determination of the DM power spectrum on all scales will be presented elsewhere. 



6 DISCUSSION AND CONCLUSIONS 

We have presented an effective implementation of analytical calculations of the Lya opacity distribution of the 
Intergalactic Medium (IGM) along multiple lines of sight (LOS) to distant quasars in a cosmological setting. We 
thereby assumed that the neutral hydrogen distribution traces the dark matter distribution on scales larger than the 
Jeans length of a warm photoionized IGM. Simulated absorption spectra with varying transverse separation between 
different LOS have been investigated. We have identified coincident absorption features as fitted with a Voigt profile 
fitting routine and calculated the cross-correlation coefficient and the cross-power spectrum of the flux distribution 
along different LOS to quantify the flux correlation. 

As expected the correlation of the flux along adjacent LOS is sensitive to the detailed shape of the power 
spectrum on scales at and above the Jeans length. We have studied the dependence of the cross-correlation coefficient 
on the shape parameter T of the assumed CDM model and on the Jeans scale which determines the small-scale 
cut-off of the power spectrum of the gas distribution due to pressure of the gas. We have confirmed previous results 
that the characteristic size of the absorbers inferred from simple hit-and-miss statistics of fitted absorption lines 
assuming spherical absorbers depends strongly on the column density threshold used and on the separation of the QSO 
pairs. This reiterates the point that the filamenatary and sheetlike distribution of the IGM suggested by numerical 
simulations makes the concept of spherical absorbers with a characteristic size of very limited use. Nevertheless, we 
obtain values which are in reasonable agreement with those derived from observations of multiple systems by Crotts 
& Fang (1998), Petitjean et al. (1998), D'Odorico et al. (1998), Young et al. (2000). 

The cross-correlation coefficient can be used to define a 'characteristic' correlation length of the absorbers in a 
more objective way. We obtain 0.10 ± 0.04 h' 1 Mpc for SCDM, 0.13 ± 0.03 h" 1 Mpc for rCDM and 0.17 ± 0.05 h' 1 
Mpc for ACDM (all at z — 2.15) as the scale where the cross-correlation coefficient falls to 0.5. This is about the 
Jeans length of the IGM at this redshift. We demonstrate that if the temperature and the slope of the temperature 
density relation can be determined indepedently then the cross-correlation coefficient can be used to constrain the 
shape parameter F in a way which is independent of the amplitude of the power spectrum. 

We furthermore propose a new technique to recover the 3D linear dark matter power spectrum by integrating 
over ID flux cross-spectra. This method is complementary to the usual 'differentiation' of ID auto-spectra and suffers 
different systematic errors. It can be used for the calculation of the cross-power spectrum from a given set of auto- 
power spectra. We show that it is mathematically equivalent to the usual 'differentiation' but offers a natural way of 
smoothing the data in the presence of noise. 

The biggest advantage of the cross-correlation of the flux distribution of adjacent lines is its ability to eliminate 
errors which are uncorrelated in different LOS. The erroneous flux fluctuations introduced by the continuum fitting 
procedure, which is necessary to remove the non-trivial wave-length dependence of the quasar emission, are such 
an error which is largely uncorrelated. We demonstrate that, as expected, such uncorrelated errors affect the cross- 
power spectrum significantly less than the auto-power spectrum. This may render the tedious and somewhat arbitrary 
continuum fitting procedure unnecessary for the recovery of the dark matter power spectrum from the flux correlations 
in adjacent LOS. 

Continuum fitting errors have been the main limitation of using flux auto-power spectra to constrain the DM 
power spectrum at scales larger than about 10h~ Mpc. When flux correlations of adjacent LOS are used the errors 
at large scales will be dominated by cosmic variance, residuals in the removal of the effect of peculiar velocities, the 
uncertainty in the temperature density relation, and possible temperature fluctuations of the IGM which result in 
opacity fluctuations due to the temperature dependence of the recombination coefficient. The errors due to cosmic 
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variance and peculiar velocities will decrease with increasing number of LOS and flux correlations should thus allow 
to extend studies of the DM power spectrum with the Lya forest to significant larger scale than is possible with 
flux auto-power spectra. We estimate that 30 pairs with separation of 1-2 arcmin are necessary to determine the ID 
cross-spectrum at scales of 60 /t -1 Mpc, with an accuracy of about 30% if the error is dominated by cosmic variance. 
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